Korean prosodic break index labelling by a new mixed method of LDA and VQ

نویسندگان

  • Pyungsu Kang
  • Jiyoung Kang
  • Jinyoung Kim
چکیده

We present a new mixed method of LDA-VQ to predict Korean prosodic break index(PBI) for a given utterance. PBI can be used as an important cue of syntactic discontinuity in continuous speech recognition(CSR). Our proposed method, LDA-VQ model, consists of three steps. At the first step, PBI was predicted with the information of syllable and pause duration through the linear discriminant analysis(LDA) method. At the second step, syllable tone information was used to estimate PBI. In this step we used vector quantization(VQ) for coding the syllable tones and PBI is estimated by tri-tone model. In the last step, two PBI predictors were integrated by a weight factor. The LDA-VQ method was tested on 200 literal style spoken sentences. The experimental results showed 72% accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Asthma in Iranian Schoolchildren: Comparison of ISAAC Video and Written Questionnaires

Background: The international study of asthma and allergies in childhood (ISAAC) is used to define the prevalence and severity of asthma in different regions. In this study we followed the performance of the ISAAC video and written questionnaires (VQ and WQ) to classify asthma in 13-14 yr-old schoolchildren. Methods: The present study was carried out on 3540 schoolchildren 13 to 14-yrs-old us...

متن کامل

Determining prominence and prosodic boundaries in Korean by non-expert rapid prosody transcription

This paper examines how non-expert listeners perceive prominence and prosodic boundaries in Korean using the Rapid Prosody Transcription (RPT) method, developed by Mo, Cole and Lee [9] for American English. While prominence is used to mark prosodically salient or “highlighted” words and phrases, prosodic boundaries demarcate units or “chunks” of speech to mirror the hierarchical relations among...

متن کامل

Using FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder

Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).

متن کامل

یک مدل موضوعی احتمالاتی مبتنی بر روابط محلّی واژگان در پنجره‌های هم‌پوشان

A probabilistic topic model assumes that documents are generated through a process involving topics and then tries to reverse this process, given the documents and extract topics. A topic is usually assumed to be a distribution over words. LDA is one of the first and most popular topic models introduced so far. In the document generation process assumed by LDA, each document is a distribution o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998